Power Log’n’Roll: Power-Efficient Localized Rollback for MPI Applications Using Message Logging Protocols

نویسندگان

چکیده

In fault tolerance for parallel and distributed systems, message logging protocols have played a prominent role in the last three decades. Such enable local rollback to provide recovery from fail-stop errors. Global techniques can be straightforward implement but at times lead slower than rollback. Local is more complicated offer faster times. this work, we study power energy efficiency implications of global We propose power-efficient version reduce consumption non-critical, blocked processes, using xmlns:xlink="http://www.w3.org/1999/xlink">Dynamic Voltage Frequency Scaling (DVFS) xmlns:xlink="http://www.w3.org/1999/xlink">clock modulation (CM). Our results 3 different MPI codes on 2 systems show that reduces CPU waste up 50% during phase, compared existing techniques, without introducing significant overheads. Furthermore, savings manifest all blocked which grow linearly with process count. estimate settings high overheads total reduced proposed

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Message Logging for Uncoordinated Checkpointing Protocols

HAL is a multidisciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L'archive ouverte pluridisciplinaire HAL, est destinée au dépôt età la diffusion de documents scientifiques de niveau r...

متن کامل

Using Message Semantics to Reduce Rollback in Optimistic Message Logging Recovery Schemes

Recovery from failures can be achieved through asyn-chronous checkpointing and optimistic message logging. These schemes have low overheads during failure-free operations. Central to these protocols is the determination of a maximal consistent global state, which is recoverable. Message semantics is not exploited in most existing recovery protocols to determine the recoverable state. We propose...

متن کامل

A Fast Rollback-Recovery Scheme based on Optimistic Message Logging

This paper presents an eecient rollback recovery scheme based on the optimistic message logging. To speed up the recovery process, the rollback point of the failed process is broadcast and other processes asynchronously make the rollback decision based on the vector time. Asynchronous recovery process usually causes two possible problems: One is the message delivered from an invalid state inter...

متن کامل

Flexible Power Electronic Transformer for Power Flow Control Applications

This paper proposes a Flexible Power Electronic Transformer (FPET) for the application in the micro-grids. The low frequency transformer is usually used at the Point of Common Coupling (PCC) to connect the low voltage grid and utility network to each other. The conventional 50Hz transformer results in enhanced low voltage-grid power management system during grid-connected operation. In this pap...

متن کامل

Consistent Rollback Protocols for Autonomic ASSISTANT Applications

Nowadays, a central issue for applications executed on heterogeneous distributed platforms is represented by assuring that certain performance and reliability parameters are respected throughout the system execution. A typical solution is based on supporting application components with adaptation strategies, able to select at run-time the better component version to execute. It is worth noting ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Parallel and Distributed Systems

سال: 2022

ISSN: ['1045-9219', '1558-2183', '2161-9883']

DOI: https://doi.org/10.1109/tpds.2021.3107745